Skip to content

test: Benchmarks#160

Merged
toby-coleman merged 18 commits into
mainfrom
test/benchmarking
Aug 28, 2025
Merged

test: Benchmarks#160
toby-coleman merged 18 commits into
mainfrom
test/benchmarking

Conversation

@toby-coleman
Copy link
Copy Markdown
Contributor

Summary

Adds a simple benchmark to the test suite.

Changes

  • Adds benchmark test on a simple 3-node model.
  • Reports benchmark results on each new PR, so that the code changes can be compared with main branch before merging.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Summary of Changes

Hello @toby-coleman, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new benchmarking capability to the test suite. Its primary goal is to enable performance regression testing by adding a simple benchmark for a 3-node model and configuring the system to report these results on new pull requests, allowing for direct comparison against the main branch before merging.

Highlights

  • New Benchmark Test: Implemented a new benchmark test for a basic 3-node Plugboard model to measure process execution performance.
  • Automated Benchmark Reporting: Configured the test suite to automatically report benchmark results on new pull requests, facilitating performance comparisons against the main branch.
  • Dependency Integration: Integrated the pytest-benchmark library into the project's test dependencies.
  • Test File Structure: Created dedicated files for organizing benchmark tests within the tests/benchmark directory.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces benchmarking to the test suite using pytest-benchmark, which is a great addition for performance tracking. The changes are well-structured, adding a new benchmark test for a simple 3-node model.

My review focuses on the implementation of the new benchmark test. I've suggested a refactoring to improve the clarity and conciseness of the test code, and to leverage pytest-benchmark's auto-rounding feature for more stable results. Overall, this is a valuable contribution to the project.

Comment thread tests/benchmark/test_benchmarking.py Outdated
@codecov
Copy link
Copy Markdown

codecov Bot commented Aug 23, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@plugboard-dev plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025
@github-actions
Copy link
Copy Markdown

Benchmark comparison for bbfd6d1f (base) vs 4917f5c5 (PR)


---------------------------------------------------- benchmark: 1 tests ----------------------------------------------------
Name (time in ms)                   Min       Max      Mean  StdDev    Median      IQR  Outliers     OPS  Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------
test_benchmark_process_run     858.1550  877.9909  866.9117  7.7217  865.4655  11.3752       2;0  1.1535       5           1
----------------------------------------------------------------------------------------------------------------------------

Legend:
  Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
  OPS: Operations Per Second, computed as 1 / Mean

@plugboard-dev plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025
@github-actions
Copy link
Copy Markdown

Benchmark comparison for bbfd6d1f (base) vs b5af6f72 (PR)


---------------------------------------------------- benchmark: 1 tests ---------------------------------------------------
Name (time in ms)                   Min       Max      Mean  StdDev    Median     IQR  Outliers     OPS  Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------
test_benchmark_process_run     867.2432  892.1093  875.9666  9.6154  874.9317  9.8430       1;0  1.1416       5           1
---------------------------------------------------------------------------------------------------------------------------

Legend:
  Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
  OPS: Operations Per Second, computed as 1 / Mean

@plugboard-dev plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025
Copy link
Copy Markdown
Contributor

@chrisk314 chrisk314 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@toby-coleman toby-coleman merged commit c32f38b into main Aug 28, 2025
18 checks passed
@toby-coleman toby-coleman deleted the test/benchmarking branch August 28, 2025 12:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants